Aiding the User Input to Virtual Training Environments: Virtual Role Players with Speech and Gesture Recognition

نویسندگان

Robert Floyd Stark

John Yen

Frank E. Ritter

چکیده

The purpose of this thesis is to address the fact that users’ input to training systems in virtual environments is not suited to their natural skills and abilities. These skills and abilities include speaking and gesturing with their bodies. This mismatch may have negative effects on their usage of the virtual environment. One assumption guiding this thesis is that it would increase immersion to allow the users to interact with the system in the same way they interact with real people. The second assumption is that multimodal input can increase users’ performance in the training scenario, especially regarding habitual and physical skills. While people use the mouse and keyboard inputs to computers all of the time, the third assumption is that natural speech and gestures would make military virtual training systems even easier to get acquainted with and use. The fourth assumption is that more natural systems may increase the amount of training that trainees can transfer to the real world. To show the potential of the approach of multimodal input, two prototype systems were created. The design and evaluation of the first prototype are described. It was intended to show the potential of gesture recognition and multimodal fusion under both ideal theoretical circumstances and controlled, but more realistic, ones. The primary problem with the first prototype was found to be the limitations with the hand recognition and tracking system. The design of the second prototype is then described. This prototype is a fully-operational virtual checkpoint training system with multimodal input and was created based on the hand tracking and other insights from the first prototype. Then the results of a demonstration at a conference are explained, including environmental factors on its usage. The thesis ends with a discussion of the insights from the last prototype and some future work, including implementation ideas, empirical studies, and general guidelines for multimodal system design.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Human Computer Interaction Using Vision-Based Hand Gesture Recognition

With the rapid emergence of 3D applications and virtual environments in computer systems; the need for a new type of interaction device arises. This is because the traditional devices such as mouse, keyboard, and joystick become inefficient and cumbersome within these virtual environments. In other words, evolution of user interfaces shapes the change in the Human-Computer Interaction (HCI). In...

متن کامل

Human Computer Interaction Using Vision-Based Hand Gesture Recognition

متن کامل

A CAVE Based 3D Immersive Interactive City with Gesture Interface

3D city models have greatly changed the way we interact with geographic information. However, both the visualization and interaction are limited on conventional 2D displays. This paper presents a system that visualizes 3D cities and supports gesture interaction in a fully immersive Cave Automatic Virtual Environment (CAVE). The proposed system utilizes gestures to control navigation, selection,...

متن کامل

Real Time Hand Gesture Recognition System for Dynamic Applications

Virtual environments have always been considered as a means for more visceral and efficient human computer interaction by a diversified range of applications. The spectrum of applications includes analysis of complex scientific data, medical training, military simulation, phobia therapy and virtual prototyping. Evolution of ubiquitous computing, current user interaction approaches with keyboard...

متن کامل

VType: Entering Text in a Virtual World

We have developed a software tool, VType, that enables a user wearing virtual reality gloves to enter text while in a virtual world. We present techniques to convert noisy data representing nger movements from the gloves into cleaner signals and determine the nger presses. Since each nger press corresponds to more than one symbol, we then use an algorithm for resolving ambiguity on such overloa...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Aiding the User Input to Virtual Training Environments: Virtual Role Players with Speech and Gesture Recognition

نویسندگان

چکیده

منابع مشابه

Human Computer Interaction Using Vision-Based Hand Gesture Recognition

Human Computer Interaction Using Vision-Based Hand Gesture Recognition

A CAVE Based 3D Immersive Interactive City with Gesture Interface

Real Time Hand Gesture Recognition System for Dynamic Applications

VType: Entering Text in a Virtual World

عنوان ژورنال:

اشتراک گذاری